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AMENDMENT TO THE CLAIMS 

1. (Original) A method for creating a task dependent unified 

language model for a selected application from a task 
independent corpus, the task dependent unified language 
model being for use in a language processing system and 
having embedded context-free grammar non- terminal 
tokens in a N-gram model, the method comprising: 

obtaining a plurality of context-free grammars comprising 
non-terminal tokens representing semantic or syntactic 
concepts, each of the context-free grammars having 
words present in the task independent corpus to form 
the semantic or syntactic concepts; 

parsing the task independent corpus with the plurality of 
context-free grammars to identify word occurrences of 
each of the semantic or syntactic concepts; 

replacing each of the identified word occurrences with 
corresponding non- terminal tokens; 

building a N-gram model having the non- terminal tokens; and 

obtaining a second plurality of context-free grammars 
comprising at least some of the same non- terminals 
representing the same semantic or syntactic concepts, 
each of the context-free grammars of the second 
plurality being more appropriate for use in the 
selected application. 

2. (Original) The method of claim 1 and further comprising: 

storing the N-gram model having the non- terminal tokens and 
the second plurality of context-free grammars having 
non- terminal tokens representing task dependent 
semantic or syntactic concepts on a computer readable 
medium. 

3. (Original) A method for creating a task dependent unified 
language model for a selected application from a task independent 
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corpus, the task dependent unified language model being for use in 
a language processing system and having embedded context-free 
grammar non-terminal tokens in a N-gram model, the method 
comprising : 

obtaining a plurality of context-free grammars comprising a 
set of context-free grammars having non- terminal tokens 
representing task dependent semantic or syntactic 
concepts and at least one context-free grammar having a 
non-terminal token for a phrase that can be mistaken 
for one of the desired task dependent semantic or 
syntactic concepts ; 

parsing the task independent corpus with the plurality of 
context-free grammars to identify word occurrences for 
each of the semantic or syntactic concepts and phrases; 

replacing each of the identified word occurrences with 
corresponding non- terminal tokens; and 

building a N-gram model having the non-terminal tokens. 

4. (Original) The method of claim 3 wherein replacing each of the 
identified word occurrences includes excluding the non-terminals 
added for the prevention of mistakes during parsing. 

5. (Original) The method of claim 3 and further comprising: 

storing the N-gram model having the non- terminal tokens and 
the set of context-free grammars having non- terminal 
tokens representing task dependent semantic or 
syntactic concepts on a computer readable medium. 

6. (Original) The method of claim 3 wherein building the N-gram 
model includes eliminating at least some of the associated text 
from the task independent corpus for non- terminal tokens that can 
be mistaken for one of the desired task dependent semantic or 
syntactic concepts . 
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7. (Original) A method for creating a language model for a 
selected application from a task independent corpus, the language 
model being for use in a language processing system, the method 
comprising : 

obtaining a plurality of context-free grammars comprising 
non-terminal tokens representing semantic or syntactic 
concepts of the selected application; 

generating word phrases from the plurality of context-free 
grammars ; 

formulating an information retrieval query from at least one 

of the word phrases; 
querying the task independent corpus based on the query 

formulated; 

identifying associated text in the task independent corpus 

based on the query; and 
building a language model using the identified text. 

8. (Original) The method of claim 7 wherein building a language 
model comprises building a N-gram language model. 

9. (Original) The method of claim 8 and further comprising: 

parsing the identified text of the task independent corpus 
with the plurality of context-free grammars to identify 
word occurrences for each of the semantic or syntactic 
concepts; 

replacing each of the identified word occurrences with 

corresponding non- terminal tokens; and 
wherein building the N-gram language model comprises building 

a N-gram model having the non-terminal tokens. 

10. (Original) The method of claim 8 and further comprising: 

building a second N-gram language model from the word phrases 
generated from the plurality of context-free grammars; 
and 

combining the first -mentioned N-gram language model and the 
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second N-gram language model to form a third N-gram 
language model . 

11. (Original) The method of claim 10 and further comprising: 

parsing the identified text of the task independent corpus 
with the plurality of context-free grammars to identify 
word occurrences for each of the semantic or syntactic 
concepts ; 

replacing each of the identified word occurrences with 
corresponding non-terminal tokens; and 

wherein the word phrases include non- terminal tokens and 
wherein building the first-mentioned N-gram language 
model comprises building a N-gram model having the non- 
terminal tokens . 

12. (Original) The method of claim 9 and further comprising: 

storing the N-gram model having the non- terminal tokens and 
the plurality of context-free grammars having non- 
terminal tokens representing task dependent semantic or 
syntactic concepts on a computer readable medium. 

13. (Original) The method of claim 7 and further comprising: 

storing the identified text of the task independent corpus 
separate from the task independent corpus . 

14. (Original) A method for creating a language model for a 
selected application from a task independent corpus, the language 
model being for use in a language processing system, the method 
comprising : 

obtaining a plurality of context-free grammars comprising 
non-terminal tokens representing semantic or syntactic 
concepts of the selected application; 

generating word phrases from the plurality of context-free 
grammars ; 

building a first N-gram language model from the word phrases; 
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formulating an information retrieval query from at least one 

of the word phrases; 
querying the task independent corpus based on the query 

formulated; 

identifying associated text in the task independent corpus 

based on the query; and 
building a second N-gram language model from the identified 

text; and 

combining the first N-gram language model and the second In- 
gram language model to form a third N-gram language 
model . 

15. (Original) The method of claim 14 wherein building the second 
N-gram language model includes using only the identified text. 

16. (Original) The method of claim 15 and further comprising: 

storing the identified text of the task independent corpus 
separate from the task independent corpus . 

17. (Original) The method of claim 16 and further comprising: 

parsing the identified text of the task independent corpus 
with the plurality of context-free grammars to identify 
word occurrences for each of the semantic or syntactic 
concepts; 

replacing each of the identified word occurrences with 
corresponding non-terminal tokens; and 

wherein the word phrases include non-terminal tokens and 
wherein building the first -mentioned N-gram language 
model comprises building a N-gram model having the non- 
terminal tokens . 

18. (Original) The method of claim 14 and further comprising: 

parsing the task independent corpus with the plurality of 
context-free grammars to identify word occurrences for 
each of the semantic or syntactic concepts; 



-7- 



replacing each of the identified word occurrences with 
corresponding non- terminal tokens; and 

wherein the word phrases include non- terminal tokens and 
wherein building the first -mentioned N-gram language 
model comprises building a N-gram model having the non- 
terminal tokens . 

19. (Original) A method for creating a unified language model for 
a selected application from a corpus, the method comprising: 

obtaining a plurality of context-free grammars comprising 
non-terminal tokens representing semantic or syntactic 
concepts of the selected application; 
building a word language model from the corpus; and 
assigning probabilities to words of at least some of the 
context-free grammars as a function of corresponding 
probabilities obtained for the same words from the word 
language model wherein assigning probabilities includes 
normalizing the probabilities of the words from the 
language model in each of the context-free grammars as 
a function of the words allowed by the corresponding 
context-free grammar. 

20. (Original) The method of claim 19 wherein the word language 
model comprises an N-gram language model. 

21. (Currently Amended) The method of claim i-920 wherein the 
corpus comprises a task independent corpus. 

22. (Original) The method of claim 21 and further comprising: 

generating word phrases from the plurality of context-free 
grammars ; 

formulating an information retrieval query from at least one 

of the word phrases; 
querying the task independent corpus based on the query 

formulated; 
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identifying associated text in the task independent corpus 

based on the query; and 
wherein building a N-gram language model includes using the 

identified text. 



23. (Original) A computer readable medium including instructions 
readable by a computer which, when implemented execute a method to 
build a task dependent unified language model for a language 
processing system, the method comprising: 

accessing a plurality of context-free grammars comprising 
non- terminal tokens representing semantic or syntactic 
concepts, each of the context-free grammars having 
words present in a task independent corpus to form the 
semantic or syntactic concepts; 

parsing the task independent corpus with the plurality of 
context-free grammars to identify word occurrences of 
each of the semantic or syntactic concepts; 

replacing each of the identified word occurrences with 
corresponding non- terminal tokens; 

building a N-gram model having the non- terminal tokens; and 

storing the N-gram model and a second plurality of context- 
free grammars comprising at least some of the same non- 
terminals representing the same semantic or syntactic 
concepts, each of the context-free grammars of the 
second plurality being more appropriate for use in the 
selected application. 

24. (Original) A computer readable medium including instructions 
readable by a computer which, when implemented execute a method to 
build a task dependent unified language model for a language 
processing system, the method comprising: 

accessing a plurality of context-free grammars comprising a 
set of context-free grammars having non- terminal tokens 
representing task dependent semantic or syntactic 
concepts and at least one context-free grammar having a 
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non-terminal token for a phrase that can be mistaken 
for one of the desired task dependent semantic or 
syntactic concepts ; 

parsing a task independent corpus with the plurality of 
context-free grammars to identify word occurrences for 
each of the semantic or syntactic concepts and phrases; 

replacing each of the identified word occurrences with 
corresponding non- terminal tokens; and 

building a N-gram model having the non-terminal tokens. 

25. (Original) The computer readable medium of claim 24 wherein 
replacing each of the identified word occurrences with 
corresponding non- terminal tokens includes excluding the non- 
terminals added for the prevention of mistakes during parsing. 

26. (Original) The computer readable medium of claim 24 having 
instructions further comprising: 

storing the N-gram model having the non- terminal tokens and 
the set of context-free grammars having non- terminal 
tokens representing task dependent semantic or 
syntactic concepts on a computer readable medium. 

27. (Original) The computer readable medium of claim 24 wherein 
building the N-gram model includes eliminating at least some of 
the associated text from the task independent corpus for non- 
terminal tokens that can be mistaken for one of the desired task 
dependent semantic or syntactic concepts. 

28. (Original) A computer readable medium including instructions 
readable by a computer which, when implemented execute a method to 
build language model for a language processing system, the method 
comprising: 

accessing a plurality of context-free grammars comprising 
non-terminal tokens representing semantic or syntactic 
concepts of the selected application; 
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generating word phrases from the plurality of context-free 
grammars ; 

formulating an information retrieval query from at least one 

of the word phrases; 
querying a task independent corpus based on the query 

formulated; 

identifying associated text in the task independent corpus 

based on the query; and 
building a language model using the identified text. 

29. (Original) The computer readable medium of claim 28 wherein 
building a language model comprises building a N-gram language 
model . 

30. (Original) The computer readable medium of claim 29 and 
having instructions further comprising: 

parsing the identified text of the task independent corpus 
with the plurality of context-free grammars to identify 
word occurrences for each of the semantic or syntactic 
concepts; 

replacing each of the identified word occurrences with 

corresponding non- terminal tokens; and 
wherein building the N-gram language model comprises building 

a N-gram model having the non-terminal tokens. 

31. (Original) The computer readable medium of claim 29 and 
having instructions further comprising: 

building a second N-gram language model from the word phrases 
from the plurality of context-free grammars; and 

combining the first -mentioned N-gram language model and the 
second N-gram language model to form a third N-gram 
language model . 

32. (Original) The computer readable medium of claim 31 and 
having instructions further comprising: 
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parsing the identified text of the task independent corpus 
with the plurality of context-free grammars to identify 
word occurrences for each of the semantic or syntactic 
concepts ; 

replacing each of the identified word occurrences with 
corresponding non-terminal tokens; and 

wherein the word phrases include non- terminal tokens and 
wherein building the first -mentioned N-gram language 
model comprises building a N-gram model having the non- 
terminal tokens . 



33. (Original) The computer readable medium of claim 30 and having 
instructions further comprising: 

storing the N-gram model having the non- terminal tokens and 
the plurality of context-free grammars having non- 
terminal tokens representing task dependent semantic or 
syntactic concepts on a computer readable medium. 

34. (Original) The computer readable medium of claim 28 and 
having instructions further comprising: 

storing the identified text of the task independent corpus 
separate from the task independent corpus . 

35. (Original) A computer readable medium including instructions 
readable by a computer which, when implemented execute a method to 
build language model for a language processing system, the method 
comprising : 

accessing a plurality of context-free grammars comprising 
non- terminal tokens representing semantic or syntactic 
concepts of the selected application; 

generating word phrases from the plurality of context-free 
grammars ; 

building a first N-gram language model from the word phrases; 
formulating an information retrieval query from at least one 
of the word phrases; 
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querying a task independent corpus based on the query 
formulated; 

identifying associated text in the task independent corpus 

based on the query; 
building a second N-gram language model from the identified 

text; and 

combining the first N-gram language model and the second N- 
gram language model to form a third N-gram language 
model . 

36. (Original) The computer readable medium of claim 35 wherein 
building the second N-gram language model includes using only the 
identified text. 

37. (Original) The computer readable medium of claim 36 and 
having instructions further comprising: 

storing the identified text of the task independent corpus 
separate from the task independent corpus . 

38. (Original) The computer readable medium of claim 37 and 
having instructions further comprising: 

parsing the identified text of the task independent corpus 
with the plurality of context-free grammars to identify 
word occurrences for each of the semantic or syntactic 
concepts; 

replacing each of the identified word occurrences with 
corresponding non-terminal tokens; and 

wherein the word phrases include non- terminal tokens and 
wherein building the first -mentioned N-gram language 
model comprises building a N-gram model having the non- 
terminal tokens . 

39. (Original) The computer readable medium of claim 35 and 
having instructions further comprising: 

parsing the task independent corpus with the plurality of 
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context-free grammars to identify word occurrences for 
each of the semantic or syntactic concepts; 

replacing each of the identified word occurrences with 
corresponding non- terminal tokens; and 

wherein the word phrases include non-terminal tokens and 
wherein building the first -mentioned N-gram language 
model comprises building a N-gram model having the non- 
terminal tokens . 

40. (Original) A computer readable medium including instructions 
readable by a computer which, when implemented execute a method to 
build a unified language model for a selected application, the 
method comprising: 

accessing a plurality of context-free grammars comprising 
non-terminal tokens representing semantic or syntactic 
concepts of the selected application; 
building a word language model from a corpus; and 
assigning probabilities to words of at least some of the 
context-free grammars as a function of corresponding 
probabilities obtained for the same terminals from the 
word language model wherein assigning probabilities 
includes normalizing the probabilities of the words 
from the word language model in each of the context- 
free grammars as a function of the words allowed by the 
corresponding context-free grammar. 

41. (Original) The computer readable medium of claim 40 wherein 
the word language model comprises an N-gram language model. 

42. (Currently Amended) The computer readable medium of claim 4-0-41 
wherein the corpus comprises a task independent corpus. 

43. (Original) The computer readable medium of claim 42 and 
having instructions further comprising: 

generating word phrases from the plurality of context-free 
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grammars ; 

formulating an information retrieval query from at least one 

of the word phrases; 
querying the task independent corpus based on the query 

formulated; 

identifying associated text in the task independent corpus 

based on the query; and 
wherein building a N-gram language model includes using the 

identified text. 



